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Abstract 

The eigenvalue problem for 3x3 octonionic Hermitian matrices contains some sur- 
prises, which we have reported elsewhere [1]. In particular, the eigenvalues need not 
be real, there are 6 rather than 3 real eigenvalues, and the corresponding eigenvec- 
tors are not orthogonal in the usual sense. The nonassociativity of the octonions 
makes computations tricky, and all of these results were first obtained via brute 
force (but exact) Mathematica computations. Some of them, such as the computa- 
tion of real eigenvalues, have subsequently been implemented more elegantly; others 
have not. We describe here the use of Mathematica in analyzing this problem, and 
in particular its use in proving a generalized orthogonality property for which no 
other proof is known. 
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1 INTRODUCTION 



Finding the eigenvalues and eigenvectors of a given matrix is one of the basic 
techniques in linear algebra, with countless applications. The familiar case 
of Hermitian (complex) matrices is very important, for instance in quantum 
mechanics, where the fact that such matrices have real eigenvalues allows them 
to represent physically observable quantities. 
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The eigenvalue problem is usually formulated over the complex numbers C, 
including the reals R as a special case. In recent work [1], we considered the 
generalization to the other normed division algebras, namely the quaternions 
HI and the octonions O. Most of the basic properties are retained, provided 
they are reinterpreted to take into account the lack of commutativity of H 
and O, and the lack of associativity of O. However, there are a number of 
surprises, including the fact that such matrices admit non-real eigenvalues. 

Our most important results concern the eigenvalue problem for 3 x 3 octonionic 
Hermitian matrices, the Jordan matrices. It turns out [1] that such matrices 
admit more than the expected 3 real eigenvalues, and that eigenvectors corre- 
sponding to different eigenvalues fail in general to be orthogonal in the usual 
sense, although they do seem to be orthogonal in a generalized sense. 

Because of the lack of both commutativity and associativity, working with 
octonionic matrices is rather tricky. All of the above results were initially dis- 
covered using a Mathematica package we have developed over the years for 
just this purpose. While we have subsequently been able to derive more el- 
egant derivations by hand for some of these results, we have not succeeded 
in doing this for all. In particular, the only current proof of the generalized 
orthogonality property consists of a lengthy, brute force Mathematica com- 
putation, which used 6 hours of CPU time on a Sparc20 with 224 Mb of 
memory. This article describes both the mathematics behind this result, and 
the Mathematica computation used to obtain it. 

We set the stage in Section 2 by reviewing some basic properties about both 
the standard eigenvalue problem and the octonions, as well as introducing 
our Mathematica package. We then summarize the theory of 3 x 3 octonionic 
Hermitian matrices in Section 3, pointing out that the only known proof of 
some of the results in this section involves direct computation using Mathe- 
matica. In Section 4 we give an explicit example, and we discuss our results 
in Section 5. 



2 BACKGROUND 

2.1 The Standard Eigenvalue Problem 

We begin by collecting some standard results about the standard eigenvalue 
problem. We give the details of some of the proofs in order to emphasize the 
use of both the commutativity and associativity of C. 

The eigenvalue problem as usually stated is to find solutions A, v to the equa- 



2 



tion 



Av = Xv 



(1) 



for a given square matrix A. The basic properties of the eigenvalue problem 
for n x n complex Hermitian matrices are well-understood. 

Lemma 1 An nxn complex Hermitian matrix A has n eigenvalues (counting 
multiplicity) , all of which are real. 

PROOF. We give here only the proof that the eigenvalues are real. Let A, 
v, A satisfy (1), with A* = A. Then 

Jv^v = (Av) ] v = v ] Av = Xv ] v (2) 
so that if v 7^ we have v^v ^ 0, which forces A = A. □ 

Lemma 2 Eigenvectors of an nxn complex Hermitian matrix A correspond- 
ing to different eigenvalues are orthogonal. 

PROOF. For m — 1,2, let v rn be an eigenvector of A = A^ with eigenvalue 
X rn . By the previous lemma, X rn G R. Then 

Xiv\v 2 = (Av l )^v 2 = v[Av 2 = X 2 v\v 2 (3) 
Then either Ai = A 2 or v\v 2 = 0. □ 

Lemma 3 For any nxn complex Hermitian matrix A, there exists an or- 
thonormal basis of C™ consisting of eigenvectors of A. 

PROOF. If all eigenvalues have multiplicity one, the result follows from the 
previous lemma. But the Gram-Schmidt orthogonalization process can be used 
on any eigenspace corresponding to an eigenvalue with multiplicity greater 
than one. □ 

These lemmas are equivalent to the standard result that a complex Hermitian 
matrix can always be diagonalized by a unitary transformation. It is important 
for what follows to realize that the form of the proofs given above relies on 
both the commutativity and the associativity of C. 
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Combining the above results, it is easy to see that any (complex) Hermitian 
matrix A admits a decomposition in terms of an orthonormal basis of eigen- 
vectors. 

Theorem 4 Let A be an n x n complex Hermitian matrix. Then A can be 
expanded as 

n 

A = J2 ^mVmVln (4) 
m=l 

where {v m ; m — 1, n} is an orthonormal basis of eigenvectors corresponding 
to eigenvalues X m . 

PROOF. By the previous lemma, there exists an orthonormal basis {v m } of 
eigenvectors. It then suffices to check that 

n 

^mVmV^Vk = X k V k (5) 

m=l 

But this follows by direct computation using orthonormality. □ 

Furthermore, the set of eigenvalues {A m } is unique, and the (unit) eigenvectors 
are unique up to unitary transformations in the separate eigenspaces (which 
reduce to multiplication by a complex phase for eigenvalues of multiplicity 
one). 

2.2 Octonions 

The quaternions HI double the dimension of the complex numbers by adding 
two additional square roots of —1, usually denoted j and k. The multiplication 
table follows from 

i 2 = j 2 = k 2 = — 1 ij = k = —ji (6) 

and associativity; note that H is not commutative. Equivalently, EI can be 
viewed as the sum of 2 copies of the complex numbers 

H = C + kC (7) 

with j being defined by j = ki. 
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Fig. 1. The representation of the octonionic multiplication table using the 7-point 
projective plane. Each of the 7 oriented lines gives a quaternionic triple. 

The octonions O in turn can be viewed as the direct sum of two copies of the 
quaternions Q 

O = H + M£ = (C + AC) + (C + kC)£ (8) 

where I is yet another square root of —1. The octonions are thus spanned by 
the identity element 1 and the 7 imaginary units {i,j,h,kl,j£,i£,£}. These 
units can be grouped into (the imaginary parts of) quaternionic subspaces 
in 7 different ways; these will be referred to as "triples". Any three of these 
imaginary units which do not lie in a such a triple anti-associate. The multipli- 
cation table can be neatly summarized by appropriately labeling the 7-point 
projective plane, as shown in Figure 1. 

Even though the octonions are not associative, since any 2 octonions lie in 
a quaternionic subspace, products involving only 2 different octonions (and 
their octonionic conjugates) do associate. For example, 

p(pq) = p 2 q (9) 
which is a weak form of associativity known as alternativity . 



This construction of a new division algebra from 2 copies of another is a special 
case of the Cayley-Dickson process; for modern treatments, see [2-6]. 
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We use the notation a to denote the (octonionic) conjugate of the octonion a, 

\a\ 2 :=aa (10) 

to denote the squared norm of a, to denote the (octonionic) Hermitian 
conjugate of the matrix A, 

[a, b] := ab — ba (11) 

to denote the commutator of a and b, and 

[a,b,c] := (ab)c- a(bc) (12) 

to denote the associator of a, b, c. Both the commutator and the associator 
are purely imaginary, totally antisymmetric, and change sign if any one of 
their arguments is replaced by its conjugate. Another octonionic product with 
these properties is given by the associative 3-form [2,7] 

$(a, b, c) = ^Re(a(6c) - c(ba)) (13) 

which reduces to the vector triple product when a, b, c are imaginary quater- 
nions. 

<S>(a,b,c) = ^Re([a,b]c) (14) 
2.3 Mathematica Package 

We needed a way to easily manipulate octonions and octonionic matrices - 
it is quite difficult to unlearn associativity! There are 2 complementary ap- 
proaches, depending on whether it is desired to manipulate abstract octonions 
or whether an explicit basis can be used. For our purposes, it was initially quite 
sufficient to work with an abstract basis: We define an octonion to be a list 
with 8 elements 



a = ai + a 2 i + a 3 j + a 4 k — a 5 k£ — a 6 j£ — a 7 i£ + a 8 £ 

= {a 1 ,a 2 ,a 3 ,a 4 ,a 5 ,a 6 ,a 7 ,a 8 } (15) 

where the signs are conventional, and where we have emphasized the role of 
the imaginary units as "basis vectors" by writing them in boldface. Octonionic 
multiplication can then be expressed as a series of rules of the form 

i* j = k (16) 
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In[8]:= Omult [03, bar [03] ] 

Out [8]= Ci + c\ + c\ + c| + cjj 

ln[9]:= Expand [assoc[01, 02, 03]] 

Out [9] = 2 & 2 b 3 c 8 kl 

In [10]:= Expand [5 [01, 02, 03]] 

Out [10]= a 2 b 3 c 4 

In[ll]:= S[i, j, k] 
Out [11]= 1 

Fig. 2. A sample Mathematica session illustrating basic manipulation of octonions 
using an explicit basis representation, where Ol, 02, 03 are defined as in (18). 

where we have introduced the symbol * to represent octonionic multiplication. 
Octonionic conjugation can be defined as a series of rules of the form 

i = -i (17) 

and everything else can be defined in terms of these 2 basic operations. 

Furthermore, as already stated, computations involving small numbers of oc- 
tonions can be dramatically simplified. For instance, 3 arbitrary octonions can 
be assumed to take the form 

01 = Ol + (22 i 

02 = b l + b 2 i + b 3 j (18) 

03 = ex + c 2 i + c 3 j + c 4 k + c 8 £ 



When implementing these ideas using Mathematica, it turned out to be more 
efficient to define the 2 fundamental operations directly on lists, rather than 
building them up in terms of rules. For instance, conjugation is more easily 
defined by 

bar[{xl_, x2_, x3_, x4_, x5_, x6_, x7_, x8_}] := 

{xl, -x2, -x3, -x4, -x5, -x6, -x7, -x8} (19) 

and there is an analogous definition of octonionic multiplication, called Omult. 
A illustration of computation in an explicit basis appears in Figure 2. 
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In [12] : = 


Omult[a, b] 




Out [12] = 


a * b 




In [13] : = 


bar [Omult [a, b] ] 




Out [131= 


b * a 




In [14] : = 


Omult [a, bar [a]] 




Out [14]= 


1 a | 2 




In [15] : = 


assoc[a, b, c] 




Out [15] = 


-(a* (b * c) ) + (a* 


b) * c 


In [16] : = 


S[a, b, c] 




Out [16] = 


~ (a* (5 * c) - c * 


(b * a) - ( a * b) * c + ( c * b) * a) 



Fig. 3. A sample Mathematica session illustrating basic manipulation of abstract 
octonions. 

Matrices can now be constructed as nested lists, and operations such as matrix 
multiplication can easily be defined. However, matrix expressions involving 
several octonions quickly become unwieldy. We therefore supplemented the 
above basic definitions in terms of an explicit basis with an alternative set of 
definitions using abstract octonions. In the process, we took full advantage of 
the formatting capabilities in Mathematica 3.0. An illustration of computation 
using the abstract approach appears in Figure 3. 

Even though we chose not to implement the concrete version of the basic 
octonionic operations in terms of rules, Mathematical ability to manipulate 
such rules was crucial in constructing the abstract version. Especially when 
teaching Mathematica to both manipulate and print matrices of lists (i.e. oc- 
tonions), the ability to easily modify the code to recognize special cases proved 
extremely helpful. (Octonions are closely related to Clifford algebras] see for 
instance [8], in which the representation theory of Clifford algebras is extended 
so as to employ octonions. There are a number of packages for manipulating 
Clifford algebras, some of which are described in [9] . Several of these can han- 
dle octonions, such as the program CLICAL by Pertti Lounesto [10] and the 
Maple package CLIFFORD by Rafal Ablamowicz [11], both of which intro- 
duce the octonions as paravectors over an appropriate Clifford algebra [12]. 
There is also an older Maple package Octonion (and the companion package 
Clifford) by Jorg Schray [13].) 
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3 3x3 OCTONIONIC HERMITIAN MATRICES 



It is not immediately obvious that 3x3 octonionic Hermitian matrices have 
a well-defined determinant, let alone a characteristic equation. We therefore 
first discuss some of the properties of these matrices before turning to the 
eigenvalue problem. 

3.1 Jordan Matrices 

The 3x3 octonionic Hermitian matrices, henceforth referred to as the Jordan 
matrices, form the exceptional Jordan algebra under the Jordan product 

Ao B :=-{AB + BA) (20) 

which is commutative, but not associative. A special case of this is 

A 2 = AoA (21) 

and we define 

A 3 := A 2 o A = A o A 2 (22) 

Remarkably, with these definitions, Jordan matrices satisfy the usual charac- 
teristic equation (see e.g. [2]) 

A 3 - (tr A) A 2 + a{A) A - (det A) I = (23) 
where cr(A) is defined by 

:= \ ((tr A) 2 -tr (A 2 )) (24) 

and where the determinant det A of A is defined abstractly in terms of the 
Freudenthal product [14,15] 

A*B = AoB- ^(Atr(B) +Btr{A)) + Vtr (A) tr (B) - tr (Ao B)) 

(25) 
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In [10] : = 



A = MakeHMatrix [ {p, m, n, a, b, c} ] 



Out [10] = 

In[ll] : = 
Out [111 = 



p a b' 
a m c 

, b c n , 

Tr [A] 

m + n + p 



In [12]:= JDet[A] 

Out [12]= mnp + b* (a * c) + (e * a) * b - n |a| 2 - m |b| 2 - p | c | 2 



In[13]:= A = A / . {a-»01, b->02, c-»03} 



Out [13]= 



p ai + a 2 i 

ai - a 2 i m 
I bi + b 2 i + b 3 j Ci - c 2 i - c 3 j - c 4 k 



bi - b 2 i - b 3 j 
Ci + c 2 i + c 3 j + c 4 k + c 8 1 
c 8 1 n 



In [14]:= JDet[A] 

Out [14]= 2 ai bi Ci - 2 a 2 b 2 ci - 2 a 2 bi c 2 - 2 a 1 b 2 c 2 - 2 a x b 3 c 3 + 2 a 2 b 3 c 4 - b 2 m ■ 
b 2 m-b 3 m-afn-a 2 n-c 2 p-c 2 p-c 3 p-cip-c|p + mnp 



Fig. 4. A Mathematica session showing the calculation of the determinant for a 
generic Jordan matrix, both abstractly and in terms of an explicit basis. 



which leads to 

det(A) = -ti((A* A) o A) (26) 



Concretely, if 



A = 



( V 


a 


b 


a 


m 


c 


u 


c 


n 



(27) 



with p, m, n G K. and a, 6, c G O, then 



tr^ = p + m + n (28) 

cr(v4) =pm + + mn — |a| 2 — |6| 2 — |c| 2 (29) 

det A = pmn + b(ac) + b(ac) — n|a| 2 — m\b\ 2 — p\c\ 2 (30) 

The determinant calculation is illustrated in Figure 4. 



10 



3.2 The Real Eigenvalue Problem 



Each division algebra can be rewritten as a real matrix algebra of appropriate 
dimension (see e.g. [1]). Under this identification, a Hermitian matrix over 
any of the division algebras becomes a real symmetric matrix. It is therefore 
clear that a 3 x 3 octonionic Hermitian matrix must have 8 x 3 = 24 real 
eigenvalues [16]. However, as we now discuss, instead of having (a maximum 
of) 3 distinct real eigenvalues, each with multiplicity 8, there appear to be (a 
maximum of) 6 distinct real eigenvalues, each with multiplicity 4. 

The reason for this is that, somewhat surprisingly, a (real) eigenvalue A of a 
Jordan matrix A does not in general satisfy the characteristic equation. To 
see this, consider the eigenvalue equation (1), with A as in (27), Ael, and 



x 



where v 



y . Assuming without loss of generality that z ^ 0, explicit 



computation yields [1] 



det(AI - A) 



z = 



A 3 - (trA) A 2 + a(A) A - det A 
b(a(czfj +c(a(bzfj — b(ac) + (ca)b 



(31) 



If a, b, c, and z associate, the RHS of (31) vanishes, and A does indeed satisfy 
the characteristic equation (23); this will not happen in general. However, 
since the LHS of (31) is a real multiple of z, this must also be true of the 
RHS, so that 



b(a(cz)^J + c (a(bzfj — b(ac) + (ca)b 



z = rz 



r 6R (32) 



which can be solved to yield a quadratic equation for r as well as constraints 
on z. 



Theorem 5 (Dray &z Manogue [1]) The real eigenvalues of the 3 x 3 oc- 
tonionic Hermitian matrix A satisfy the modified characteristic equation 



det(A7 - A) = A 3 - (trA) A 2 + a (A) A - det A = r 



(33) 



where r is either of the two roots of 



r 2 + 4$(a, b, c) r - [a,b,c 



= 



(34) 



with a, b, c as defined by (27) and where $ was defined in (H). 
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Furthermore, provided that [a, b, c] 7^ 0, each of x, y, and z can be shown to 
admit an expansion of the form (given for z only) 

Corollary 6 With A and r as above, and assuming [a, b, c] ^ 0, 



with a, (3, 7, 8 e K. 

PROOF. Both of these results were obtained using Mathematica to solve 
(32) by brute force for real r and octonionic z given generic octonions a, b, c. 
An outline of the computation appears in Figure 5. □ 

The real parameters a,(3,^f,S may be freely specified for one (nonzero) compo- 
nent, say z; the remaining components x,y have a similar form. 

The solutions of (32) are real, since the corresponding 24 x 24 real symmetric 
matrix has 24 real eigenvalues. We will refer to the 3 real solutions of (32) 
corresponding to a single value of r as a family of eigenvalues of A. There 
are thus 2 families of real eigenvalues, each corresponding to a 4 independent 
(over M) eigenvectors. 

We note several intriguing properties of these results. If A is in fact complex, 
then the only solution of (34) is r = 0, and we recover the usual characteristic 
equation with a unique set of 3 (real) eigenvalues. If A is quaternionic, then 
one solution of (34) is r = 0, leading to the standard set of 3 real eigenvalues 
and their corresponding quaternionic eigenvectors. However, unless a, b, c 
involve only two independent imaginary quaternionic directions (in which case 
$(a, b, c) = = [a, b, c]), there will also be a nonzero solution for r, leading 
to a second set of 3 real eigenvalues. Finally, if A is octonionic (so that in 
particular [a, b, c] ^ 0), then there are two distinct solutions for r, and hence 
two different sets of real eigenvalues, with corresponding eigenvectors. Note 
that if det A = ^ [a, b, c] then all of the eigenvalues of A will be nonzero! 

3. 3 Orthogonality 

The final surprise lies with the orthogonality condition for eigenvectors v,w 
corresponding to different eigenvalues. It is not true that v'w = 0, although 
the real part of this expression does vanish [1]. But, at least in the 2x2 case 
[1], it is straightforward to show that what is needed to ensure a decomposition 



z = (aa + (3b + 7c + 5) 1 + 

V 



[a, b, c] r 
2 

[a, b, c] 



(35) 
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In[10]:= LHS : = Omult[b, Omult[a, Omult[c, z] ] ] + 

Omult [bar [c] , Omult [bar [a] , Omult [bar [b] , z] ] ] - 

Omult [Omult [b, Omult[a, c] ] + Omult [Omult [bar [c] , bar[a]], bar[b]], z] 

In [11] : = RHS : = r z 

In [12]:= Eq : = LHS - RHS 

In [13] : = LHS == RHS 

Out [13]= b * (a* (c * z) ) + c * (a* (b * z) ) - (b * (a*c) + (c * a) * b) *z ==rz 
In [14] := a : = 01 ; b : = 02 ; c : = 03 ; z : = oz ; 

In[15]:= ZRule = Solve [Expand[Eq[[{5, 6, 7, 8}]]] ==0, z[[{5, 6, 7, 8}J] 

r r 2 a 2 b 3 c 8 Zi 2 a 2 b 3 c 8 z 2 2 a 2 b 3 c 8 z 3 2 a 2 b 3 c 8 z 4 , , 

Out [15]= z 5 -> - , z 6 => - , z 7 => , z 8 => 

1 1 4 a 2 b 3 c 4 + r 4 a 2 b 3 c 4 + r 4 a 2 b 3 c 4 + r 4 a 2 b 3 c 4 + r ' ' 

In [16]:= Oz : = z / . ZRulepj 

In[17]:= RRule = Simplify [ Solve [ == Expand [Eq / . MakeRule [z, Oz] ] , r] ] 

Out [17]= {{r -» 2 a 2 b 3 (-c 4 + ^fcj + c| j } , {r ^ -2 a 2 b 3 (c 4 + cj + c 2 a j }} 

ln[18] := Req = Expand[r A 2 + 4 RS[a, b, c] r - onormsq[assoc[a, b, c] ] ] 

Out [18]= -4 a 2 b 3 c 8 + 4 a 2 b 3 c 4 r + r 2 
In[19]:= Expand[Req /. RRule] 

Out[19]= {0, 0} 

Expand [as soc [a, b, c] ] r 

in [20] := Pz = Omult (aa + /3b + yc + c5 1 ), 1 + ; 

Expand[onormsq[assoc[a, b, c] ] ] 

In[21]:= PRule = Solve [0 == Oz - Pz /. RRule[[l]] , z[[{l, 2, 3, 4}]]] 

Out [21]= {{zi ^ ai a + bi /3 + cj y + 6, z 2 ^ a 2 a + b 2 + c 2 y , z 3 -> b 3 /3 + c 3 y, z 4 -» ^/c| + c| y} } 



Fig. 5. An outline of the derivation of the modified characteristic equation using 
Mathematica. We first construct to be (32), then solve half these equations for 
some of the components of z in terms of the others (this is ZRule) . We then solve 
the remaining equations for r (this is RRule) and verify that the solutions have the 
form claimed in Theorem 5, i.e. that they satisfy (34) (which is Req). Finally, we 
verify that z has the form claimed in (35) (this is PRule). 

of the form (4) is the following generalized notion of orthogonality, which does 
in fact hold. 

Definition 7 (Dray &z Manogue [1]) Let v and w be two octonionic vec- 
tors. We will say that w is orthogonal to v if 

( vv i) w = (36) 



In the 3x3 case, a lengthy, direct computation verifies that eigenvectors with 
different real eigenvalues satisfy (36) provided that the same value of r is used 
for both eigenvectors. 
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Theorem 8 (Dray & Manogue [1]) Ifv andw are eigenvectors of the 3x3 
octonionic Hermitian matrix A corresponding to different real eigenvalues in 
the same family (same r value), then v and w are mutually orthogonal in the 
sense of (36). 

PROOF. The modified characteristic equation (33) can be used to eliminate 
cubic and higher powers of A from any expression. Furthermore, given two 
distinct eigenvalues Ai ^ A2, subtracting the two versions of (33) and factoring 
the result leads to the equation 

(\j + AiA 2 + A*) - (tr A) (A x + A 2 ) + a(A) = (37) 

which can be used to eliminate quadratic terms in one of the eigenvalues. 
We used Mathematica to implement these simplifications in a brute force 
verification of (36) in this context, which used 6 hours of CPU time on a SUN 
Sparc20 with 224 Mb of RAM. A summary of the computation appears in 
Figures 6 and 7. □ 

For Jordan matrices, we thus obtain two decompositions of the form (4), 
corresponding to the two sets of real eigenvalues. For each, the eigenvectors 
are fixed up to orthogonal transformations which preserve the form (35) of z. 

Theorem 9 (Dray &z Manogue [1]) Let A be a 3x3 octonionic Hermitian 
matrix. Then A can be expanded as in (4), where {v±, v 2 , v 3 } is an orthonormal 
basis, as per (36), of eigenvectors of A corresponding to the real eigenvalues 
X m , which belong to the same family (same r value). 

PROOF. Fix a family of real eigenvalues of A by fixing r. If the eigenval- 
ues are distinct, then the previous theorem guarantees the existence of an 
orthonormal basis of eigenvectors, and the result follows. 
If the eigenvalues are the same, the family consists of a single real eigenvalue 
A with multiplicity 3. Then tr (A) = 3A and a {A) = 3A 2 . Writing out these 
two equations in terms of the components (27) of A, and inserting the first 
into the second, results in a quadratic equation for A; the discriminant D of 
this equation satisfies D < 0. But A is assumed to be real, which forces D = 0, 
which in turn forces A to be a multiple of the identity matrix, for which the 
result holds. 

The remaining case is when one eigenvalue, say /i, has multiplicity 2 and one 
has multiplicity 1. Letting v be a (normalized) eigenvector with eigenvalue /x, 
consider the matrix 

X = A - a vv ] (38) 
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In[17]:= 

ZO := Expand[Oz Denominator [Oz [ [8] ]] ] 

In[18]:= 

X[i_] :=Omult [a, Omult [c, ZO] ] + 

(q[i]-m) Omult [bar [b] , ZO] ; 
Y[i_] :=Expand [Omult [bar [a] , Omult [bar [b] ,Z0] ]+ 

(q[i]-p) Omult [c,Z0]]; 
Z[i_] :=Expand[ 

((q[i]-p) (q[i]-m)-onormsq[a] ) ZO] 

In[21]:= 

V[i_] :=MakeVector [{X[i] ,Y[i] ,Z[i] }] 

In[22]:= 

{rl,r2}=r/.RRule; 

r2Rule=r A 2->Expand[ (rl+r2) *r-rl*r2] ; 
r3Rule=r A 3->Expand [Expand [ 

r (r A 2/.r2Rule) ]/.r2Rule] ; 

In[25]:= 

qRule = q[2] A 2 -> Expand[ 

(Tr [A] [[l]]-q[l]) (q[l] +q[2] ) -sigma [A] [[1]]]; 
q3Rule = q[l] A 3 -> Expand [Tr [A] [ [1] ]q[l] A 2— 

sigma [A] [ [1] ] q[l] +Jdet [A] +r] ; 
q4Rule = q[l] A 4 -> Expand [Expand [ 

q[l]*(q[l] A 3 /. q3Rule)] /. q3Rule] ; 

In[28]:= 

ZeroVector==Expand [Expand [Expand [ 

MMult [A, V[l] ]-q[l]V[l] ] / . q3Rule] / . r2Rule] 

Out [28]= 

True 

In[29]:= 

W=Expand [Expand [Expand [MSq[V[l] ] ]/. {q4Rule, q3Rule} ] 
/. {r3Rule, r2Rule}] ; 

Fig. 6. The construction of eigenvectors V[i] of A with different real eigenvalues q[i}. 
The characteristic equation (33) is implemented via q3Rule; the condition (14) on r 
is given by r2Rule, and qRule is the extra condition (37). Finally, VV = V[l] V[l]^. 

with aeK. For most values of a, X will have 3 distinct real eigenvalues, whose 
eigenvectors will be orthogonal by the previous theorem. But this means that 
eigenvectors of X are also eigenvectors of A; the required decomposition of A 
is obtained from that of X simply by solving (38) for A. □ 

Note in particular that for some quaternionic matrices with determinant equal 
to zero, one and only one of these two decompositions will contain the eigen- 
value zero. 
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In[7]:= 

WWO=Expand [MMult [ W, W] ] ; 

In[9]:= 

Clear [W, W] 

In[10]:= 

wwoil=wwo [[1,1]]; 

WW021=WW0 [[2,1]]; 
WW031=WW0 [[3,1]]; 
Clear [WWO] ; 

In[15]:= 

WWll=Expand [Expand [ 

WWO H/.{q4Rule,q3Rule}]/.{r3Rule,r2Rule}]; 
Clear [WWOll] ; 

In[19]:= 

WW21=Expand [Expand [ 

WW021/. {q4Rule,q3Rule}] /. {r3Rule, r2Rule} ] ; 
Clear [WW021] ; 

In[23]:= 

WW31=Expand [Expand [ 

WW031/. {q4Rule,q3Rule}] /. {r3Rule, r2Rule} ] ; 
Clear [WW031] ; 

In[27]:= 

WW=MakeVector[{WWll,WW21,WW31}] ; 

In[30]:= 

WW===ZeroVector 



Out [30]= 
True 



In[33]:= 

TimeUsed[]/3600. 



Out[33]= 

5.90069 



Fig. 7. A summary of the Mathematica 2.2 computation used to show that eigen- 
vectors V, W of Jordan matrices with different real eigenvalues are orthogonal in 
the generalized sense. Note that each term was simplified separately. 
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4 Example 



Let s be given by 



s = cos 9 + k£ sin 9 



(39) 



and consider the matrix 



B 



( P 

—iq 

y —kqs 



iq kqs \ 
-jq p J 



(40) 



noting that B is quaternionic if 9 = 0. Turning first to the equation for r, (32) 
becomes 



r 2 + Aq 3 r cos 9 - 4g° sin z 9 = 



6„- 2 



(41) 



with solutions 



r± = -2g 3 (cos^±l) 



(42) 



Since 



tr B = 3p 
a(B) = 3(p 2 -q 2 ) 
det B=p 3 - 3pq 2 + 2q 3 cos 6> 

the eigenvalue equation (33) becomes 



(43) 
(44) 
(45) 



= A 3 - 3p A 2 + 3(p 2 -q 2 )\- (p 3 - 3pq 2 T 2q 3 ) 
= (X-pTq) 2 (X-p±2q) 



(46) 
(47) 



An orthonormal basis of eigenvectors associated with these eigenvalues is 



X u = p ± q : u± = 



X v = p±q : v± 



o (f u R±) 
( i 

2ks | (f v R ± ) 
\ i 



(48) 



(49) 
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In[34]:= ks :=Omult[k, Cos[6] 1 +Sin[6] kl] 



o o 

In [35]:= R + : = Sin [ — ] 1 + Cos [ — ] kl; 

In[36] := B = MakeHMatrix [ {p, p, p, q*i, -q*ks, q*j}] 

' p qi q Cos [9] k - q Sin [6] 1 

out [36]= -qi p qj 

, -q Cos [9] k + q Sin [9] 1 -q j p 
in [37]:= A* : = p + q; : = p + q; A; : = p - 2 q; 



In [38]:= u + = SMultR [MakeVector [ { i , ozero, j}], Omult[l , R, ] ] 



Out [38]-- 



{ Sin[f ] i + Cos[f] jl 


Cos[f-] il + Sin[|-] j 



ln[39]:= v + = Simplify [SMultR [MakeVector [{ j , 2ks, i}], Omult[i, R + ] ] ] 

-Sin[ f ] k - Cos [-§-] 1 



Out [39]= 



2 Cos [f] il - 2 Sin[f-] j 



-Sin[- 



■ Cos [ - 



kl 



ln[40]:= w + = Simplify [ SMultR [MakeVector [{ j , -ks, i}], Omult[j, R» ] ] ] 



Out [40]- 



-Sin[f ] -Cos[f ] kl \ 
-Sin[|-] i - Cos [-§-] jl 



i Sin[f ] k + Cos[f ] 1 

In[41]:= {Simplify [MMult [B, u t ] - X* u u, ] == ZeroVector, 
Simplify [MMult [B, v + ] - v + ] == ZeroVector, 
Simplify [MMult [B, w, ] - A* w + ] == ZeroVector} 

Out [41]= {True, True, True} 

In [42]:= Simplify [MMult [dagger [u, ] , v + ] ] 

Out[42]= -Sin[0] il + (-l-Cos[8]) j 

In [43]:= MSq[x_] : = MMult [x, dagger [x] ] 

In [44]:= Simplify [MMult [MSq[u + ] , v + ] ] == ZeroVector 

Out [44]= True 

In[45]:= Simplif y [X^ MSq [ ] +A^MSq[-^— ] +A;MSq[-^— ]] == B 

V2 V~6 V~3 

Out[45]= True 

Fig. 8. A Mathematica computation illustrating that the vectors u + , v+, w + given 
in (48)-(50) are indeed eigenvectors of the Jordan matrix B given in (40) with the 
given eigenvalues (with /„ = 1, f v = i, and f w = j), that these eigenvectors are 
only orthogonal in the generalized sense of Theorem 8, and that they lead to a 
decomposition of B as implied by Theorem 9. (Normalization factors have been 
added in the final computation, since the given vectors are not unit vectors.) 
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X w = p =f 2q : w± = \ -ks \ (f w R±) (50) 

where f u , f v , f w are arbitrary linear combinations of a = iq, b = —kqs, c = jq, 
i.e. 

fujvjw e (l,a,b,c) (51) 

and where R± is given by 

f sin | + U cos | 

^ = e hf ■ e ( 52 ) 
[ cos I — K-t sin | 

Note that in the limiting case — > 0, each / is quaternionic, and i?_ reduces 
to 1 while R + becomes hi. 

As expected, provided one fixes a family of eigenvectors and eigenvalues arising 
from a given choice of r, these eigenvectors satisfy the orthogonality property 
(36) and thus lead to a decomposition of the form (4). A partial verification 
of this using Mathematica is given in Figure 8. 



5 Discussion 



There are 2 quite different surprising aspects of our work: the mathematical 
changes needed to extend the eigenvalue problem to the octonions, and the 
fact that we have only been able to prove one of our key results using computer 
algebra. We discuss each of these in turn. 

It is of course intriguing that the eigenvalue problem over the octonions 
changes so much, for instance in that there are unexpectedly many real eigen- 
values. But we find it remarkable that so much of the standard structure re- 
mains, provided it is reinterpreted appropriately. The most striking example 
of this is the need to generalize what is meant by orthogonality. 

We can relate our notion of ort honor mality to the usual one by noting that a 
basis of O n which is orthonormal in the sense (36) satisfies 

vv^ + ... + ww^ = I (53) 



which follows directly from the definition. If we define a matrix Q whose 
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columns are just v, w, then this statement is equivalent to 

QQ 1 = I 



(54) 



Over the quaternions, left matrix inverses are the same as right matrix inverses, 
and we would also have 

Q ] Q = I (55) 

or equivalently 

v*v = 1 = ... = w^w; v^w = = ... (56) 

which is just the standard notion of orthogonality. These two notions of or- 
thogonality fail to be equivalent over the octonions; we have been led to view 
the former as more fundamental. 

Turning to our proof-by-computer, we reiterate that the only proof we cur- 
rently have of our main orthogonality result, namely Theorem 8, uses Mathe- 
matica to explicitly perform a horrendous, but exact, algebraic computation. 
While one could hope for a more elegant mathematical proof of this result, 
the Mathematica computation nevertheless establishes a result which would 
otherwise remain for the moment merely a conjecture. This is a good example 
of being able to use the computer to verify one's intuition when it may not be 
possible to do so otherwise. 

But even more is true: Throughout our work with the octonions, the ability to 
manipulate octonionic expressions quickly and accurately has been crucial in 
developing our intuition in the first place. We do not feel that we would have 
been able to reach anything like our current understanding of the applications 
of the octonions to physics, on which we continue to be working actively, 
without the availability of a package such as the one described here. 

Finally, this computation was initially done in 1996 using Mathematica 2.2. 
While preparing this paper, we attempted to reproduce the computation using 
Mathematica 3.0 — and couldn't! Even for Mathematica 2.2, it was necessary 
to massage the computation by hand in order to succeed. One way this was 
done (between Figure 6 and Figure 7) was by saving some intermediate steps 
to files and then restarting the kernel. Another technique was not to simplify 
all the components of an expression at the same time. For instance, in Figure 7, 
the vector WW = (VVt)W contains 3 octonions, each of which requires roughly 
8 Mb. 

Comparing the computations both versions of Mathematica could handle, 
Mathematica 3.0 appears to require nearly 4 times as much CPU time for 
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the same computation; this was for identical inputs, with a minimum of spe- 
cial formatting. It is unfortunate that the many nice features of Mathematica 
3.0 appear to require such a high price. 
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